Expression of speaker's intentions through sentence-final particle/ intonation combinations in Japanese conversational speech synthesis
نویسندگان
چکیده
Aiming to provide the synthetic speech with the ability to express speaker’s intentions and subtle nuances, we investigated the relationship between the speaker’s intentions that the listener perceived and sentence-final particle/intonation combinations in Japanese conversational speech. First, we classified F0 contours of sentence-final syllables in actual speech and found various distinctive contours, namely, not only simple rising and falling ones but also rise-and-fall and fall-and-rise ones. Next, we conducted subjective evaluations to clarify what kind of intentions the listeners perceived depending on the sentence-final particle/intonation combinations. Results showed that adequate sentence-final particle/intonation combinations should be used to convey the intention to the listeners precisely. Whether the sentence was positive or negative also affected the listeners’ perception. For example, a sentence-final particle ‘yo’ with a falling intonation conveyed the intention of an “order” in a positive sentence but “blame” in a negative sentence. Furthermore, it was found that some specific nuances could be added to some major intentions by subtle differences in intonation. The different intentions and nuances could be conveyed just by controlling the sentence-final intonation in synthetic speech.
منابع مشابه
Expressing Speaker's Intentions through Sentence-Final Intonations for Japanese Conversational Speech Synthesis
In this study, we investigated speaker’s intentions that the listeners perceive through subtly different sentence-final intonations. Approximately 2,000 sentence utterances were recorded and the fundamental frequency (F0) contours at the last vowel of those sentences were classified through one of the standard clustering algorithms. There found various F0 contours, namely, not only simple risin...
متن کاملUsing Interactive Tasks to Elicit Natural Dialogue
Basic research into the relationship between intonation and speaker’s intentions about syntax and information structure addresses whether, when, and how speakers use prosodic information to signal linguistic and paralinguistic meaning. Speakers use prosody for a range of functions in communication: to mark the difference between immediately relevant vs. background information; to express contra...
متن کاملAnalysis of factors involved in the choice of rising or non-rising intonation in question utterances appearing in conversational speech
In general, the end of question utterances is accompanied by a rising intonation. However, non-rising intonation is commonly observed in question utterances appearing in conversational speech. In order to clarify the factors involved in the choice of rising or non-rising intonation, in the present work, we analyzed question utterances extracted from Japanese conversational dialogue speech data ...
متن کاملDevelopment and Evaluation of Online Infrastructure to Aid Teaching and Learning of Japanese Prosody
This paper develops an online and freely available framework to aid teaching and learning the prosodic control of Tokyo Japanese: how to generate its adequate word accent and phrase intonation. This framework is called OJAD (Online Japanese Accent Dictionary) [1] and it provides three features. 1) Visual, auditory, systematic, and comprehensive illustration of patterns of accent change (accent ...
متن کاملGeneration of A ect in Synthesized Speech
When compared to human speech, synthesized speech is distinguished by insu cient intelligibility, inappropriate prosody and inadequate expressiveness. These are serious drawbacks for conversational computer systems. Intelligibility is basic | intelligible phonemes are necessary for word recognition. Prosody | intonation (melody) and rhythm | clari es syntax and semantics and aids in discourse o...
متن کامل